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Abstract 

Background: Two distinctly different types of measurement error are Berl<son and classical. Impacts of 
measurement error in epidemiologic studies of ambient air pollution are expected to depend on error type. We 
characterize measurement error due to instrument imprecision and spatial variability as multiplicative (i.e. additive 
on the log scale) and model it over a range of error types to assess impacts on risk ratio estimates both on a per 
measurement unit basis and on a per interquartile range (IQR) basis in a time-series study in Atlanta. 

Methods: Daily measures of twelve ambient air pollutants were analyzed: NO2, NOx, O3, SO2, CO, PIVlio mass, PM2.5 
mass, and Pl\/l2.5 components sulfate, nitrate, ammonium, elemental carbon and organic carbon. Semivariogram 
analysis was applied to assess spatial variability. Error due to this spatial variability was added to a reference 
pollutant time-series on the log scale using Monte Carlo simulations. Each of these time-series was exponentiated 
and introduced to a Poisson generalized linear model of cardiovascular disease emergency department visits. 

Results: Measurement error resulted in reduced statistical significance for the risk ratio estimates for all amounts 
(corresponding to different pollutants) and types of error. When modelled as classical-type error, risk ratios were 
attenuated, particularly for primary air pollutants, with average attenuation in risk ratios on a per unit of 
measurement basis ranging from 18% to 92% and on an IQR basis ranging from 18% to 86%. When modelled as 
Berkson-type error, risk ratios per unit of measurement were biased away from the null hypothesis by 2% to 31%, 
whereas risk ratios per IQR were attenuated (i.e. biased toward the null) by 5% to 34%. For CO modelled error 
amount, a range of error types were simulated and effects on risk ratio bias and significance were observed. 

Conclusions: For multiplicative error, both the amount and type of measurement error impact health effect 
estimates in air pollution epidemiology. By modelling instrument imprecision and spatial variability as different 
error types, we estimate direction and magnitude of the effects of error over a range of error types. 



Background 

The issue of measurement error is unavoidable in epide- 
miologic studies of air pollution [1]. Although methods 
for dealing with this measurement error have been pro- 
posed [2,3] and applied to air pollution epidemiology 
specifically [4,5], the issue remains a central concern in 
the field [6]. Because large-scale time-series studies 
often use single central monitoring sites to characterize 
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community exposure to ambient concentrations [7], 
uncertainties arise regarding the extent to which these 
monitors are representative of exposure. Zeger et al. [8] 
identify three components of measurement error: (1) the 
difference between individual exposures and average 
personal exposure, (2) the difference between average 
personal exposure and ambient levels, and (3) the differ- 
ence between measured and true ambient concentra- 
tions. While the former two components of error can 
have a sizeable impact on epidemiologic findings that 
address etiologic questions of health effects and personal 
exposure, it is the third component that is particularly 
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relevant in time-series studies that address questions of 
the health benefits of ambient regulation [9]. 

Prior studies have suggested that the impact of mea- 
surement error on time-series health studies differs 
depending upon the type of error introduced [8,10,11]. 
Two distinctly different types of error have been identi- 
fied. One type is classical error, in which measurements, 
Zi, vary randomly about true concentrations, Z*; this 
can be considered the case for instrument error asso- 
ciated with ambient monitors. That is, instrument error 
is independent of the true ambient level, such that 
E[Zj|Z*] = Z*. Moreover, the variation in the measure- 
ments, Zf, is expected to be greater than the variation in 
the true values, Z*. Therefore, classical error is expected 
to attenuate the effect estimate in time-series epidemio- 
logic studies. In contrast, under a Berkson error frame- 
work, the true ambient, Z* varies randomly about the 
measurement, Zj. This might be the case, for example, 
of a measured population average over the study area 
with true individual ambient levels varying randomly 
about this population average measurement. In this 
case, measurement error is independent of the measured 
population average ambient; that is, E[Z*|Zt]=Zt. 
Furthermore, the measurement, Zf, is less variable than 
the true ambient level, Z*. A purely Berkson error is 
expected to yield an unbiased effect estimate, provided 
that the true dose-response is Unear [3]. 

Several studies have investigated the impact of error 
type on regression models. The simultaneous impact of 
classical and Berkson errors in a parametric regression 
estimating radon exposure has been investigated [12] 
and error type has been assessed in a semiparametric 
Bayesian setting looking at exposure to radiation from 
nuclear testing [13,14]; however, no study to date has 
comprehensively assessed the impact of error type 
across multiple pollutants for instrument imprecision 
and spatial variability in a time-series context. 

Error type depends on the relationship between the 
distribution of measurements and the distribution of 
true values. Because true relevant exposure in environ- 
mental epidemiologic studies is not known exactly, 
determination of error type is challenging; thus, here we 
examine the impact of error modelled as two distinctly 
different types: classical and Berkson. First, we examine 
monitor data to assess whether error is better modelled 
on a logged or unlogged basis. Typically, researchers 
investigating error type have added error on an 
unlogged basis (e.g. [8,11]); however, air pollution data 
are more often lognormal due to atmospheric dynamics 
and concentration levels that are never less than zero. It 
is plausible that true ambient exposures are distributed 
lognormally about a population average as well; there- 
fore, measurement error may be best described as addi- 
tive error on the log scale. We investigate the combined 



error from two sources that have been previously identi- 
fied as relevant in time-series studies: (1) instrument 
precision error and (2) error due to spatial variability 
[9]. We limit our scope to ambient levels of pollutants 
measured in accordance with regulatory specifications, 
disregarding spatial microscale variability, such as near 
roadway concentrations, as well as temporal microscale 
variability, such as that associated with meteorological 
events on sub-hour time scales. Here, building on a pre- 
viously developed model for the amount of error asso- 
ciated with selected ambient air pollutants [15], we 
quantitatively assess the effect of error type on the 
impacts of measurement error on epidemiologic results 
from an ongoing study of air pollution and emergency 
department visits in Atlanta. 

Methods 

Air Pollutant Data 

Daily metrics of 12 ambient air pollutants were studied: 
1-hr maximum NO2, NOx, SO2 and CO, 8-hr maximum 
O3, and 24-hr average PMio, PM2.5 and PM2.5 compo- 
nents sulfate (SO4), nitrate (NO3), ammonium (NH4), 
elemental carbon (EC) and organic carbon (OC). Obser- 
vations were obtained from three monitoring networks: 
the US EPA's Air Quality System (AQS), including State 
and Local Air Monitoring System and Speciation Trends 
Network for PM2.5 component measurements; the 
Southeastern Aerosol Research and Characterization 
Study (SEARCH) network [16], including the Atlanta 
EPA supersite at Jefferson Street [17]; and the Assess- 
ment of Spatial Aerosol Composition in Atlanta 
(ASACA) network [18]. Locations of the monitoring 
sites are shown in Figure 1. 

To assess error due to instrument imprecision and 
spatial variability of ambient concentrations, 1999-2004 
datasets were used for the 12 pollutants with data com- 
pleteness for this time period (2,192 days) ranging from 
82% to 97%. Data from collocated instruments were 
used to characterize instrument precision error. Mea- 
surement methods and data quality are discussed in 
detail in our prior work [15]. Distributions of all air pol- 
lutant measures more closely approximate lognormal 
distributions than normal distributions ([19], see Addi- 
tional file 1, Table SI); therefore, additive error was 
characterized and modeled on a log concentration basis 
so that simulations with error added to a base case 
time-series would also have lognormal distributions. 

Measurement Error Model 

The measurement error model description here high- 
lights differences from our previous work in which error 
type effects were not addressed [15]. In this study, a 
time-series of observed data was taken to be the "true" 
time-series, Z*, serving as a base case. Classical-like or 
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Figure 1 Map of 20-county metropolitan Atlanta study area. Census tracts, expressways, and ambient air pollutant monitoring sites are 
shown. 



Berkson-like error was added to this base case to pro- 
duce a simulated time-series, Z^, that represents a popu- 
lation-weighted average ambient time-series. Here, the 
asterisk refers to a true value (i.e. without error) as 
opposed to a value that contains error (i.e. the simulated 
values in this study). The choice of which pollutant to 
use for the true, or base case, time-series is arbitrary, as 
long as an association with a health endpoint has been 
observed with that pollutant. To develop simulated data- 
sets with modeled instrument and spatial error added, 
the following steps were taken. Base case time-series 
data were normalized as follows. 



InZ* 



flnZ* 



(1) 



Here, X* is the normalized log concentration on day t 
and fiinz* and ffinz* are the mean and standard deviation, 
respectively, of the log concentrations over all days t; 
thus, the mean and standard deviation of /* are 0 and 
1, respectively. Error in x* was modeled as multiplicative 
(i.e. additive on a log scale) as follows. 



(2) 



Here, ej,t is the modeled error in x* for day t, is a 
random number with distribution ~N{0,\) and fferr is 



the standard deviation of error added, a parameter 
derived from the population-weighted semivariance to 
capture the amount of error present for each pollutant, 
as described in the next subsection. Short-term temporal 
autocorrelation observed in the differences between 
measurements was modeled using a three-day running 
average of random numbers for Nt [15]. 

To provide simulations of monitor data with error 
added (Zf), the modeled error was added to normalized 
data and then the normalized data with error added 
were denormalized in two ways: one to simulate classi- 
cal-like error (i.e. classical error on a log concentration 
basis, referred to here as type C error) and the other to 
simulate Berkson-like error (i.e. Berkson error on a log 
concentration basis, referred to here as type B error). 
Simulations with type C error are generated by eq. 3. 



type C error : Xt = xt + «xt 



(3) 



Here, Xt is the standardized simulated time-series (on 
the log scale) with type C error added and normal dis- 
tribution ~ N y/\ + (Terr^^- In this case of type C 
id X* are independent (i.e. 



error. 



E[R{e^t, Xt)] = 0). For type B error, e^f and Xt are inde- 
pendent (i.e. E[R(F.y,t, Xt)] = 0) and x* = Xt + e/t- It can 
be shown (see Additional file 2, eqs. S1-S6) that 
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simulations with type B error can be generated from 
the true time-series by eq. 4. 



type B error : Xt = [x* + + ^err^) 



(4) 



Here, Xt is the standardized simulated time-series (on 
the log scale) with type B error added and normal distri- 

1 



bution ~ N 0, 



After the standardized 



simulated time-series is generated by either eq. 3 or eq. 
4, the simulations are denormalized by eq. 5. 



Zi = exp (XtO-lnZ* + MlnZ«) 



(5) 



For both error types, the simulated time-series (Z^) 
and true time-series (Z*) have the same log means {fiinz 
= fiinz')- For classical-like error (type C), the log stan- 
dard deviation is greater for the simulated time-series 
than the true time-series {Ginz x^inz*) because the simu- 
lated values are scattered about the true values. For 
Berkson-like error (type B), the log standard deviation is 
less for the simulated time-series than the true time-ser- 
ies (<Tinz «7inz*) because the true values are scattered 
about the simulated values. 

Semivariogram Analysis 

To quantify the amount of error (i.e. fferr) due to instru- 
ment imprecision and spatial variability to add to the 
simulated time-series for each pollutant (eq. 2), we 
made use of the geostatistical tool of the semivariogram, 
which provides information on spatial autocorrelation of 
data and has proved useful in air pollution applications 
[20,21]. Here, the semivariance of the differences 
between normalized observations (Xk and Xi) at two 
locations (/c and I) located a distance h apart is normal- 
ized by the temporal variance (variation over the time- 
series of observations) of the average of two normalized 
observations to yield a scaled semivariance, /. It can be 
shown that this scaled semivariance (i.e. the semivar- 
iance of normalized values) is related to the Pearson 
correlation coefficient (R) between normalized observa- 
tions from two monitors as follows [21]. 



Var 



y' (h) 



Xk - XI 



l-Rih) 



(6) 



Thus, y represents the spatial semivariance scaled to a 
quantity indicative of the range of exposures over which 
health risk is being assessed; it is unitless and allows for 
comparison across pollutants. A scaled semivariance 
value of 0 corresponds to perfectly correlated observa- 
tions (R = 1) and a value of 1 corresponds to perfectly 
uncorrelated observations (R = 0). 



Correlations between observations from all pairs of 
monitors measuring the same pollutant during 1999- 
2004 were calculated on a log concentration basis. 
Assuming the spatial variation of air pollutants to be 
isotropic, scaled semivariograms were constructed and 
modeled as a function of the distance between observa- 
tions, h, using a sill of 1, nugget values derived from 
collocated measurement time-series described in pre- 
vious work, and least squares regression to determine 
the range [15]. The estimate from the semivariogram 
function for each of the 660 Census tracts was weighted 
by the population in that tract (estimates from 2000 
Census data) to derive an overall population-weighted 
average for each pollutant; thus, the population- 
weighted semivariance includes impacts of both instru- 
ment imprecision and spatial variability and represents 
the population-weighted average semivariance between 
all residences in the study area. 



^ /660 660 550 \ 



(7) 



Here, y' is the population-weighted average scaled 
semivariance on a log scale, Ptotai is the total population 
of the study area, Pij is the sum of population in census 
tracts i and /', and vlj is the value of the semivariance 
function at the distance between centroids of census 
tracts i and /. For within-tract resident pairs, an average 
distance between residences was applied. Semivario- 
grams for each of the twelve pollutants studied have 
been shown previously [15] and population-weighted 
semivariances are in Table 1. The population-weighted 
semivariance is related to the population-weighted 



Table 1 Population-weighted scaled semivariances, y', 
Pearson correlation coefficients, ^, and model 
parameters used in the Monte Carlo simulations to 
simulate amount of error {(Terr) arid error type {(7t„z/(Ti„z») 



Pollutant 


Y' 


R 




C^lnz/flnZ' 

Type B 


flnz/t^lnZ* 

Type C 


1-hr max NOj 


0.516 


0.320 


1.46 


0.57 


1.77 


1-hr max NO^ 


0.384 


0.445 


1.12 


0.67 


1.50 


8-hr max O3 


0.051 


0.903 


0.33 


0.95 


1.05 


1-hr max SO2 


0.517 


0.319 


1.46 


0.56 


1.77 


1-hr max CO 


0411 


0418 


1.18 


0.65 


1.55 


24-hr PM,o 


0.192 


0.678 


0.69 


0.82 


1.21 


24-hr PM2.5 


0.100 


0.819 


047 


0.90 


1.1 1 


24-hr PM2.5-SO4 


0.068 


0.873 


0.38 


0.93 


1.07 


24-hr PM2.5-NO3 


0.140 


0.754 


0.57 


0.87 


1.15 


24-hr PM2.5-NH4 


0.149 


0.741 


0.59 


0.86 


1.16 


24-hr PM2.5-EC 


0.337 


0.495 


1.01 


0.70 


1.42 


24-hr PM2.5-OC 


0.175 


0.702 


0.65 


0.84 


1.19 
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correlation coefficient as follows. 

_ l-R 

Y' = — = 
1+R 



(8) 



Model parameter a^rr (eq- 2) is defined to provide 
simulations with an amount of error such that 
£[J?(lnZ, InZ*)] = \^ where R is obtained from semi- 
variogram analysis (eqs. 6-8). The correlation between 
the true ambient time-series and a time-series with 
error added, i.e. R(ln Z, In Z*), is the square root of the 
correlation between any two time-series, i.e. R(ln Zj, In 
Z2), where each is derived by adding the same amount 
of error to the true ambient time-series. Since the stan- 
dard deviation of Xt depends on a^m the standard devia- 
tion of the simulated time-series relative to that of the 
true time-series {(Jinz/ <yinz*) depends on r as well. The 
following analytical relationships for a^rr and <Jinz/<^inZ' 
were derived (see Additional file 2, eqs. S7-S10). 



2y' 

1-7 



1 -R 



(9) 



glnZ 
CTlnZ* 



1 + K' 1 

= = — = > 1 for error type C 

1 - y' 



(10) 



'i^=^/^<l 
1 + y' 



for error type B 



Values of Oerr and (Jinzf <^inZ' used here can be found 
in Table 1. 

Sets of 1000 simulated time-series with instrument 
and spatial error added for each pollutant for the sce- 
narios of C and B error types were produced for the six- 
year period 1999-2004. In addition, simulations of CO 
measurement error only were generated for a range of 
error types with (Jinz/ (^inz* values between error types C 
and B. Scatterplots demonstrate that C and B error 
types defined on a log basis (i.e. InZ - InZ*) are inde- 
pendent of InZ* and InZ, respectively (see Additional 
file 3, Figure SI). 

Epidemiologic iVIodel 

Relationships between daily measures of ambient air 
pollution and daily counts of emergency department 
(ED) visits for cardiovascular disease (CVD, including 
ischemic heart disease, dysrhythmia, congestive heart 
failure, and peripheral/cerebrovascular disease) were 
assessed using methods described elsewhere [22] and 
briefly summarized here. There were 166,950 ED visits 
for CVD in the 20-county metropolitan Atlanta area 
during 1999-2004. Lag 0 associations between daily 



pollutant concentration and the daily count of ED visits 
were assessed using Poisson generalized linear models 
that were scaled to accounted for overdispersion. The 
general form of the epidemiologic model is 



log[i;(yt)] = a + p Zt + y confounderst 



(11) 



where Yt is the count of emergency department visits, 
Zf is the mismeasured pollutant concentration, and con- 
founderst is the vector of potential confounders on day 
t. The specific potential confounders included in the 
model were indicator variables for day- of- week, season, 
and when a hospital entered or left the study; cubic 
terms for maximum temperature and dew point; and a 
cubic spline with monthly knots for day of follow-up. 
Poisson regression yields a as the intercept, P as the log 
of the rate ratio associated with a unit change in pollu- 
tant concentration, and 7 as the vector of regression 
coefficients for the suspected confounders included in 
the model. The risk ratios (RR) per unit of measurement 
change and per interquartile range (IQR) change in pol- 
lutant concentration (Z) are given by eq. 12 and eq. 13, 
respectively. 



RR per unit = e'^ 



RR per IQR = e^^'*''^ 



(12) 



(13) 



Using data from the central monitor, preliminary epi- 
demiologic assessments were performed for all air pollu- 
tants and ED visits for CVD. Consistent with previous 
findings [22], significant positive associations were found 
for several traffic-related pollutants, including NOx, CO 
and EC. For the measurement error analysis described 
here, we used 1-hr maximum CO data as our base case, 
representing in our analysis a true time-series and the 
measured risk ratio the true association. In this way, the 
exposure and health outcome values that we chose to 
represent true time-series have distributional characteris- 
tics expected of ambient air pollution and ED visit data. 
Simulations with measurement error added to the base 
case were used to evaluate the impact of measurement 
error on the epidemiologic analyses. A Monte Carlo 
approach was used to assess uncertainty. As already 
described, the relationship between this base case time- 
series and a simulated time-series is that expected of the 
average relationship between the true ambient time-ser- 
ies for all people and a population-weighted average 
time-series based on measurements in terms of error 
amount, with different error types evaluated. A percent 
attenuation in risk ratio (toward the null hypothesis of 1) 
is calculated as follows, with RR* representing the true 
risk ratio (obtained from the base case Poisson 
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regression) and RR representing the risk ratio obtained 
using simulated population-weighted time-series. 



percent attenuation in RR ■■ 



RR* - RR 



RR* 



1 



X 100% (14) 



Results 

Distribution of IVIeasurement Error Simulations 

Analysis of the distributions of correlation coefficients 
between the true log concentrations (i.e. the base case) 
and the simulated log concentrations, R(InZ, InZ*), for 
1000 simulations for each pollutant and each error type 
demonstrates that the simulations contain on average 
the desired amounts and types of error (Figure 2, see 
Additional file 4, Figure S2 for distribution of error type 
results). Wider distributions were observed for more 
spatially heterogeneous pollutants. 

Impact of Error on Health Risk Assessment 

For the base case of 1-hour maximum CO measure- 
ments and CVD outcomes, a RR per ppm of 1.0139 was 
observed, with a 95% confidence interval (CI) of 1.0078- 
1.0201 and a p-value of 0.000009. With an IQR of 1.00 
ppm, the RR per IQR and corresponding CI are the 



same as those on a per unit of measurement basis for 
our base case. For epidemiologic models using the time- 
series with simulated error added, the RR and CI results 
are not the same on a per measurement unit basis and a 
per IQR basis because the IQR of the simulated values is 
not 1. As expected, the simulated time-series with error 
type C has a greater IQR than the base case since this 
error is scattered about the true values, and the simu- 
lated time-series with error type B has a lower IQR than 
the base case since this error is scattered about the 
simulated values. Results of 1000 epidemiologic models 
for each of 12 air pollutants and two error scenarios are 
summarized in Table 2. The reported p-values represent 
those calculated from average z-score statistics and 95% 
confidence intervals were calculated using the asympto- 
tic standard error estimates obtained from the regres- 
sion model. 

When instrument imprecision and spatial variability 
error were added as error type C, the average IQR of 
simulated time-series was greater than the IQR of the 
base case for all pollutants; for error type B, the average 
IQR of simulated time-series was less than the IQR of 
the base case for all pollutants. As expected, adding 
error to the base case resulted in a reduction of signifi- 
cance (i.e. a higher p-value) for both error types, as 



N 

■S 08 

N" 



N 
NT 



I i 



i 



i 



NO2 NO, O3 SO2 
(0.57) (0.67) (0.95) (0.56) 



CO PM10 PM25 SO4 

(0.65) (0.82) (0.90) (0.93) 



NO3 NH4 EC OC 

(0.87) (0.86) (0.70) (0.83) 



i 



NO, 



NO, O3 SO2 CO PMio PM25 SO4 NO3 NH4 EC OC 

(0.57) (0.67) (0.95) (0.56) (0.65) (0.82) (0.90) (0.93) (0.87) (0.86) (0.70) (0.83) 

Figure 2 Boxplots of R(lnZ, InZ*), with expected correlation coefficients shown In parentheses for 1000 simulated data time-series of 
error type C (top panel) and type B (bottom panel) simulations 
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Table 2 Summarized epidemiologic model results with the magnitude of error representative of error associated with 
using a population-weighted average for each pollutant added to the base case (RR* = 1.0139, 95% CI = 1.0078- 



1.0201, p-value 


= 0.000009, IQR = 1.00 ppm) 










pollutant 


RR per ppm (95% CI) 


IQR (ppm) 


RR per IQR (95% CI) 


p-value 


Error Type C simulations 


1-hr max NO2 


1.0011 (0.9998-1.0023) 


1.84 


1 .0020 (0.9997- 


.0042) 


0.0957 


1-hr max NOx 


1.0024 (1.0003-1.0046) 


1.51 


1.0037 (1.0005- 


.0070) 


0.0251 


8-hr max O3 


1.0114 (1.0060-1.0169) 


1.05 


1.0120 (1.0063- 


.0178) 


0.00004 


1-hr max SO2 


1.0011 (0.9998-1.0023) 


1.84 


1.0019 (0.9997- 


.0042) 


0.0966 


1-hr max CO 


1.0021 (1.0002-1.0040) 


1.57 


1.0033 (1.0003- 


.0063) 


0.0342 


24-hr PM,o 


1.0063 (1.0025-1.0102) 


1.20 


1.0076 (1.0030- 


.0122) 


0.0013 


24-hr PM2.5 


1.0094 (1.0045-1.0142) 


1.10 


1.0103 (1.0049- 


.0156) 


0.000157 


24-hr PM2.5-SO4 


1.0107 (1.0054-1.0159) 


1.07 


1.0114 (1.0058- 


.0170) 


0.000066 


24-hr PM2.5-NO3 


1.0079 (1.0035-1.0123) 


1.14 


1.0090 (1.0040- 


.0141) 


0.00040 


24-hr PM2.5-NH4 


1.0076 (1.0033-1.0119) 


1.15 


1.0088 (1.0038- 


.0137) 


0.00050 


24-hr PM2.5-EC 


1.0032 (1.0006-1.0057) 


1.42 


1.0045 (1.0009- 


.0081) 


0.0140 


24-hr PM2.5-OC 


1.0068 (1.0028-1.0108) 


1.18 


1 .0080 (1 .0033- 


.0128) 


0.00090 


Error Type B simulations 


1-hr max NO2 


1.0182 (1.0041-1.0325) 


0.51 


1.0092 (1.0021- 


.0165) 


0.0112 


1-hr max N0>, 


1.0169 (1.0056-1.0284) 


0.61 


1.0103 (1.0034- 


.0172) 


0.0034 


8-hr max O3 


1.0142 (1.0075-1.0208) 


0.94 


1.0133 (1.0070- 


.0195) 


0.000027 


1-hr max SO2 


1.0182 (1.0041-1.0325) 


0.51 


1.0092 (1.0021- 


.0164) 


0.0114 


1-hr max CO 


1.0172 (1.0053-1.0292) 


0.59 


1.0101 (1.0031- 


.0171) 


0.0044 


24-hr PM,o 


1.0152 (1.0068-1.0236) 


0.78 


1.0117 (1.0053- 


.0182) 


0.00030 


24-hr PM2.5 


1.0144 (1.0073-1.0217) 


0.88 


1.0127 (1.0064- 


.0190) 


0.000074 


24-hr PM2.5-SO4 


1.0143 (1.0074-1.0211) 


0.92 


1.0130 (1.0068- 


.0193) 


0.000039 


24-hr PM2.5-NO3 


1.0147 (1.0071-1.0225) 


0.83 


1.0122 (1.0059- 


.0186) 


0.000152 


24-hr PM2.5-NH4 


1.0148 (1.0070-1.0226) 


0.82 


1.0121 (1.0058- 


.0185) 


0.000175 


24-hr PM2.5-EC 


1.0165 (1.0060-1.0271) 


0.65 


1.0106 (1.0038- 


.0174) 


0.0021 


24-hr PM2.5-OC 


1.0150 (1.0069-1.0232) 


0.79 


1.0119 (1.0055- 


.0183) 


0.00030 



shown graphically in Figure 3. The greater the amount 
of error (i.e. the greater the population-weighted semi- 
variance), the greater the reduction in significance 
observed. Primary pollutants (SO2, NO2/NOX, CO, and 
EC) had more error than secondary pollutants and those 
of mixed origin (O3, SO4, NO3, NH4, PM2.5, OC, and 
PMio) due to greater spatial variability. Regarding error 
type, there was a greater reduction of statistical signifi- 
cance when error type was modeled as type C than 
when error type was modeled as type B. For NO2 and 
SO2, which have the largest amount of measurement 
error, there was a loss of significance (p-value > 0.05) 
when error was modeled as error type C. 

Risk ratio results for the two error types are plotted in 
Figure 4 on a percent attenuation basis. RR per unit of 
measurement decreased, and attenuation increased, with 
increasing error added (i.e. increasing population-weighted 
semivariance) when the error was of type C. However, RR 
per unit increased, with increasing bias away from the 



null, with increasing error added when error was of type 
B. For NO2 and SO2, which had the most measurement 
error, the attenuation was 92% when modeled as error 
type C and biased away from the null by 31% when mod- 
eled as error type B. On a per IQR basis, variation in the 
RR estimates between error types was much less dramatic. 
Both error types C and B led to lower RR estimates (i.e. 
bias towards the nuU). For NO2 and SO2, which again had 
the most measurement error, the attenuation was 86% 
when modeled as type C and 34% when modeled as type 
B error. For error type B there was a wider distribution of 
results than for type C error. 

To assess a range of error types, simulations were gen- 
erated with values of (Jinz/Cinz* ranging from that of 
error type C to that of type B (eq. 10) for the case of an 
amount of error representative of CO {y' = 0.411). Epi- 
demiologic model results for RR attenuation are shown 
in Figure 5. On a per unit of measurement (ppm) basis, 
RR attenuation increased from -24% (i.e. a bias away 
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population-weighted semivariance 

Figure 3 P-values versus population-weighted semivariance 

Half-bars denote standard deviations for 1000 error simulations. 



from the null) for type B error to 85% for type C error. 
On a per IQR basis, RR attenuation increased from 28% 
for type B error to 85% for type C error. It is interesting 
to note that for (Tin^/Cinz* the error (Z - Z*) is indepen- 
dent of Z (i.e. R(Z - Z*, Z) = 0) and the RR per unit 
attenuation is 0. This is the expected result when error 
is the Berkson type on an unlogged basis. 

Discussion 

The results demonstrate that error type affects the 
reduction in significance as well as the RR estimate in 
the epidemiologic analysis. Moreover, the results 
demonstrate a profound effect of error type on the RR 
estimate per unit of measurement. The RR per unit of 
measurement estimate is increased by the presence of 
type B error; that is, there is a bias away from the null. 
To better understand these results, we estimate the 
attenuation in the effect estimator p (eq. 11) in the 



absence of confounders from the first-order linear 
regression coefficient (m) of error (Z-Z*) versus Z as fol- 
lows. 



— = 1 — m 



(15) 



For RR estimates near 1 (i.e. P values near 0) as is the 
case in this study, the predicted attenuation in RR is 
approximately given as follows. 

(16) 



RR per unit attenuation « m 



RR per IQR attenuation ^ 1 — (1 — m) 



IQR 

IQR* 



(17) 



Epidemiologic model results are compared with the 
predictions of eq. 16 and eq. 17 for all pollutants and 
both error types (Figure 6). The degree to which the 
epidemiologic results differ from these predictions likely 
indicates the degree to which confounding variables are 
affecting results. As shown by the 1:1 line in Figure 6, 
there is strong agreement between the attenuation pre- 
dicted by analysis of the error model results (i.e. m and 
IQR) and that obtained from the epidemiologic model. 

In this study, in which quantification of error is based 
on the variability between monitors, error due to spatial 
variation is much greater than error due to instrument 
imprecision, particularly for primary air pollutants [15]. 
Conceptually, therefore, we speculate that this error is 
more likely of the Berkson type, with true values varying 
randomly about a population-weighted average repre- 
sented by the base case. If spatial error is best described 
by the Berkson-like type defined on a log basis (our 
error type B) and the mean of the measurements is the 
same mean as the true values, we estimate there to be a 




Figure 4 Percent attenuation in risk ratio per ppm (left panel) and per IQR (right panel) due to error versus population-weighted 
semivariance. Bars denote standard deviations for 1000 error simulations. Pollutant labels are in order of increasing population-weighted 
semivariance. 
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Figure 5 Percent attenuation in risk ratio per unit of 
measurement (ppm) and per IQR for CO error simulations [y' 
= 0.411) with incremental changes in error type ranging from 
type B (CTinz/tTinz- = 0.65) to type C (cTinz/<Tinz' = 1-55) Bars 
denote standard deviations for 1000 simulations. 



24% to 34% attenuation in RR per IQR estimates (Figure 
4, right panel), and a 19% to 31% bias away from the 
null in RR estimates on a per unit of measurement basis 
(Figure 4, left panel), for the primary pollutants studied 
(SO2, NO2/NOX, CO, and EC) when using a population- 
weighted average as the exposure metric. For the sec- 
ondary pollutants and pollutants of mixed origin (O3, 
SO4, NO3, NH4, PM2.5, OC, and PMjo), we estimate a 
5% to 15% attenuation in RR per IQR estimates and a 
2% to 9% bias away from the null in RR estimates on a 
per unit of measurement basis. We are currently investi- 
gating different methods for estimating actual error type 
based on simulated pollutant fields trained to have all of 



the characteristics, including the pattern of spatial auto- 
correlation, expected of true pollutant fields. 

This study addresses error between measured and 
true ambient concentrations. Our results are consistent 
with previous finding that suggest that Berkson error, 
as defined on an unlogged scale (additive), produces 
no bias in the effect estimate [8,11] as shown in Figure 
5; however, Berkson-like error defined on a log basis 
(multiplicative) can lead to risk ratio estimates per unit 
increase that are biased away from the null (although 
with a reduction in significance). Thus, the direction 
and magnitude of the bias are functions of error type. 
With the multiplicative error structure used here in 
conjunction with a linear dose response, large "true" 
values of air pollution would likely be underestimated, 
resulting in an overestimation of pollution health 
effects. We have shown how multiple air pollution 
measurements over space can be used to quantify the 
amount of error and provide a strategy for evaluating 
impacts of different types of this error. The results 
suggest that estimating impacts of measurement error 
on health risk assessment are particularly important 
when comparing results across primary and secondary 
pollutants as the corresponding error will vary widely 
in both amount and type depending on the degree of 
spatial variability. These results are suggestive of error 
impacts one would have from time-series studies in 
which a single measure, such as the population- 
weighted average, is used to characterize an urban or 
regional population exposure. The methodology used 
here can be applied to other study areas to quantify 
this type of measurement error and quantify its 
impacts on health risk estimates. 
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Figure 6 Attenuation in the risk ratio per unit of measurement (left panel) and per IQR (right panel) due to the introduction of 
measurement error, modeled both as type B and type C error Ranges denote standard deviations for 1000 simulations. One-to-one line is 
also shown. 
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Conclusions 

Health risk estimates of exposure to ambient air pollu- 
tion are impacted by both the amount and the type of 
measurement error present, and these impacts vary sub- 
stantially across pollutants. By modeling combined 
instrument imprecision and spatial variability over a 
range of error types, we are able to estimate a range of 
effects of these sources of measurement error, which are 
likely a mixture of both classical and Berkson error 
types. This study demonstrates the potential impact of 
measurement error in an air pollution epidemiology 
time-series study and how this impact depends on error 
type and amount. 
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